(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
28 March 2002 (28.03.2002) 




PCT 



(10) International Publication Number 

WO 02/25233 A2 



(51) International Patent Classification 7 : G0IJ 3/00 

(21) International Application Number: PCI7US0 1/25706 

(22) International Filing Date: 16 August 2001 (16.08.2001) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

09/664,973 1 8 September 2000 (1 8.09.2000) US 

(71) Applicant: INSTRUMENTATION METRICS, JNC. 

| US/US]; 7470 West Chandler Blvd., Chandler, AZ 85226 
(US). 

(72) Inventors: HAZEN, Kevin, H.; 1534 W. Islandia Drive, 
Gilbert, AZ 85233 (US). BLANK, Thomas, B.; 2922 
E. Tulsa Slrccl, Chandler, AZ 85225 (US). MONFRE, 
Stephen, L.; 1289 E. Palo Blanco Way, Gilbert, AZ 85296 
(US). RUCHTI, Timothy, L.; 1501 West Sea Haze Drive, 
Gilbert, AZ 85233 (US). 



(81) Designated States (national): AE, AL, AM, AT, AU, AZ, 
BA, BB, BG, BR, BY, CA, CM, CN, CR, CU, CZ, DE, DK, 
DM, EE, liS, PI, GB, GD, GE, Gil, GM, MR, HU, ID, 1L, 
IN, IS, JP, KE, KG, KP, KR, KZ, IX, LK, LR, LS, LT, LU, 
LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, IT, 
RO, RU, SD, SIS, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, 
UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent ((HI, GM, 
KB, US, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CM, CY, DE, DK, BS, PJ, PR, GB, GR, IK, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BP, BJ, CP, 
CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, 
1X5). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(74) Agents: GLENN, Michael ct al.; Glenn Patent Group, Sle. 
L., 3475 Edison Way, Mcnlo Park, CA 94025 (US). 



< 

m * — 

fT) (54) Title: A METHOD OP CHARACTERIZING SPPCPROM ETER INSTRUMENTS AND PROVIDING CALIBRATION 
MODELS TO COMPENSATE POR INSTRUMENT VARIATION 

in 

^ (57) Abstract: Spectrometer instruments are characterized by classifying their spectra into previously defined clusters. The spectra 
arc mapped to the clusters and a classification is made based on similarity of cxiracied spectral features to one of Ihe previously 
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A METHOD OF CHARACTERIZING SPECTROMETER 
INSTRUMENTS AND PROVIDING CALIBRATION MODELS TO 
COMPENSATE FOR INSTRUMENT VARIATION 

5 

BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 

10 

The invention relates to variation in spectrometer instruments. More particularly 
the invention relates to characterizing spectrometer instruments by classifying their 
spectral responses into a limited number of clusters and developing calibration 
transfer models between clusters that compensate for instrument variations. 
15 * 

DESCRIPTION OF THE PRIOR ART 

Many of the analytical applications for spectrometers require calibration data sets 
that are time-consuming and expensive to create. Typically, these calibrations are 

20 highly specific. For example, apparently identical instruments produced by the 
same manufacturer may exhibit minor instrument variations; such variations may 
be seen when one instrument is built with a component that varies slightly from 
the same component in another instrument. In addition, a calibration set for an 
instrument produced by one manufacturer is generally not suitable for a similar 

25 . instrument produced by another manufacturer. Furthermore, repairs to a single 
instrument can cause the instrument's spectral response to vary. As an instrument 
ages, it's spectral response may change. An instrument's spectral response may 
vary according to fluctuations in the operating environment. In applications 
requiring analysis of very low concentration analytes, non-invasive blood glucose 

30 prediction, for example, even minor instrument variation can introduce an 
unacceptable degree of error into the analysis. Providing another calibration 
model that takes the instrument's current spectral response into account can 
compensate for instrument variation. However, development of new calibration 
models is time-consuming, labor-intensive and costly. 

35 

In the development of spectroscopy-based analyzers for biomedical 
applications, there is a need for production of thousands to as many as millions of 
analyzers for a specific application. No methodology exists for providing 
calibrations for large numbers of instruments quickly and inexpensively. 
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Therefore, efforts have been directed at transferring calibrations from one 
analyzer to another. See, for example, E. Bouveresse, C. Harimann, D. 
Massart, I. Last, K. Prebble, Standardization of near-infrared spectrometric 
instruments, AnaL Chem., vol. 68, pp. 982-990 (1996) and M. Defernez, R. 
Wilson, Infrared spectroscopy: instrumental factors affecting the long-term validity 
of chemometric models, AnaL Chem. . vol. 69, pp. 1288-1294 (1997), and E. 
Bouveresse, D. Massart, P. Dardenne, Calibration transfer across near-infrared 
spectrometric instruments using Shenk's algorithm: effects of different 
standardization samples. Analvtica Chimica Acta , vol. 297, pp. 405-416, (1994) 
and Y. Wang, D. Veltkamp, B. Kowalski, Multivariate instrument calibration, Anal. 
Chem. . vol. 63, pp. 2750-2756 (1991). 

Most of the reported methods of calibration transfer have been applied h 
situations involving high-concentration analytes, wherein the signal-to-noise ratio is 
high. Because these currently known methods act as a smoothing function when 
transferring calibrations, they degrade the signal to noise that can be observed, 
thus hindering analysis of low concentration analytes. Additional problems of 
changes in resolution or bandwith across time or between instruments have not 
been addressed. 

Furthermore, the currently known methods have only been successfully applied 
in situations involving a small number of instruments. The reported methods are 
not capable of modeling the complexity encountered when large numbers of 
instruments are involved. 

A need exists for the calibration of large numbers of analyzers. It would be 
desirable to provide a means of reducing the complexity inherent in the transfer 
of calibrations to large numbers of analyzers. It would also be advantageous to 
provide a means of transferring calibrations without significant degradation of the 
signal-to-noise ratio, rendering calibration transfer practical in analysis of low 
concentration analytes. 



SUMMARY OF THE INVENTION 

The invention provides a method of characterizing large numbers of 
spectrometer instruments by classifying their spectra into a limited number of 
previously defined clusters according to similarity of spectral features and 
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performance characteristics. The method may also be used to track variation over 
time within a single instrument. The spectral features used for classification may 
be related to known instrument parameters, or they may be abstract features 
derived using a variety of computational methods. The clusters are defined h 
5 advance based on an exemplar data set, using either supervised or 
unsupervised methods. Calibration models for each cluster compensate for 
instrument variation, either across instruments or across time within a single 
instrument. 

0 In a preferred embodiment of the invention, calibration models are provided 
using a method of calibration transfer wherein the clusters are mapped to each 
other, so that a calibration transferred from one cluster to another need only model 
the difference between the two clusters. In an alternate embodiment, a different 
calibration is separately calculated for each cluster. In either embodiment, the 

5 number of calibration transfers is significantly reduced since each cluster 
represents a multitude of instruments. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a block schematic diagram of a method of characterizing 
spectrometer instruments by clustering according to spectral features, according 
to the invention; 

Figure 2 provides a flow diagram of a method of generating calibration models 
by transferring a master calibration to slave calibration models, according to the 
invention; 

Figures 3A and 3B illustrate decreases in spectral cutoff related to decreases h 
detector temperature in a spectrometer instrument; and 

Figure 4 illustrates changes in light throughput related to changes in source 
temperature in a spectrometer instrument. 

DETAILED DESCRIPTION 

The invention provides a method of characterizing spectrometer instruments b y 
classifying spectral responses from a large number of spectrometers (analyzers) 
into a limited number of previously defined clusters. The clusters constitute 
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groups that are defined based on the similarity of specific features. Grouping of 
instruments according to spectral features and performance characteristics reduces 
the spectral variation between instruments in a given group. Therefore, spectral 
measurements corresponding to an individual cluster are more homogeneous 
5 than those from the entire group of analyzers. Calibrations created for each cluster 
may then be utilized for subsequent analysis. The calibration created for a cluster 
; will have instrument variations included in the model that mirror variations resulting 
from the particular analyzer employed for the subsequent analysis. Calibration 
models specific to the clusters are expected to be less complex and have an 
10 improved level of accuracy. Therefore, multivariate analysis will require fewer 
factors to model instrument variation resulting in earlier factors modeling the 
sample and earlier factors modeling small absorbing analytes. The net result is 
that fewer factors are required for sample analysis and a more robust algorithm is 
created. 

15 

In one embodiment of the invention, a separate calibration data set is required for 
every cluster, requiring a large number of samples in a calibration set for every 
cluster. In the preferred embodiment, sufficient samples for a calibration model 
are only required in a single cluster. Principal features defining that cluster and 
20 other clusters are determined, and the clusters are mapped to each other, 
eg revealing the specific differences from one cluster to another. Subsequently, the 
calibration from the initial cluster is transferred to another cluster based upon the 
specific differences between clusters. 

25 The parent application to the current application, S. Malin and T. Ruchti, An 
intelligent system for noninvasive blood analyte prediction, U.S. Patent 
Application Ser. No. 09/359,191 (July 22, 1999), provides a detailed 
description of a method of classifying spectral measurements into previously 
defined clusters through similarities observed in absorbance spectra. The 

30 classification system of the sampled tissue volume of the subjects is herein 
expanded to include classification of instrumentation variations. 

Referring now to Figure 1, a general method of classifying spectral 
measurements into previously defined clusters is represented. Briefly the steps 
35 of the method are: 



1. Measurement. (11) 
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2. Classification (12), in which the measured spectrum is assigned 
membership in any one of a number of predefined clusters 13. 

3. Calibration (14), in which calibration models suited to each cluster are 
5 provided. 

4. Identifying outliers (15) 
MEASUREMENT 

In general, instrumental variations may affect spectral response by producing 
10 either signal intensity changes, bandwidth changes, wavelength changes, or 
combinations thereof. These instrumental variations may include: 

1 . wavelength shifts; 

2. nonlinear wavelength shifts; 

3. wavelength expansions; 
15 4. wavelength contractions; 

5. nonlinear wavelength expansions; 

6. source intensity drifts; 

7. blackbody profile changes; 

8. bandwidth changes; 
20 9. resolution changes; 

10. baseline deviations; 

1 1 . change with time; 

12. temperature effects; 

13. detector response; 

25 14. differences in optical components (e.g. long-pass filters or fiber optics); 

15. variation related to mounting of references; 

1 6. differences in the optical interface to the sample (fiber spacing); 

17. linearity; 

18. detector cut-off; 

30 and many others, which will be apparent to those skilled in the art. Spectra used 
for classification will typically be those of commonly known standards. Standards 
particularly useful for classifying shifts observed in the wavelength axis include 
polystyrene, rare earth oxides: holmium oxide, erbium oxide or dysprosium 
oxide, for example; or combinations thereof. Standards such as the diffuse 

35 reflectance standards supplied by Labsphere, Inc. (North Sutton NH) may be 
used to classify shifts in the intensity axis. Spectra of samples may also be used 
for classification. Additional intensity and wavelength standards will be known to 
those skilled in the art. 
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In the specific case of near-IR noninvasive glucose determination these standards 
cover the near-IR spectral region. Additional spectroscopy and chromatography- 
spectroscopy hyphenated techniques such as AA (atomic absorption 
5 spectroscopy) or GC-MS (gas chromatography mass spectroscopy) will each 
require their own standards which are known by those skilled in the art. 
Additionally, spectra of tissue phantoms collected on an instrument may also b e 
used to characterize that instrument. Tissue phantoms are helpful in characterizing 
an instrument because spectra of these phantoms simulate noninvasive spectra 
10 of living tissue. There are tissue phantoms that simulate the absorption 
coefficients of various skin tissues; while others simulate the scattering coefficients 
of the body. Some common simulants include: 

1 . milk and milk products 

2. milk products in combination with India ink, used to adjust the absorption 
15 coefficient; 

3. emulsions of fatty substances in water maintained in solution with an 
emulsifiersuchas lecithin. One such commercial product is INTRALIPID, 
supplied by Kabivitrum AB (Stockholm, Sweden); 

4. Intra-serum and Intra-gel. See K. Hazen, J. Welch, S. Malin, T. Ruchti, A 
20 Lorenz, T. Troy, S. Thenadil, T. Blank, Intra-serum and intra-gel for 

modeling human skin tissue, U.S. Patent Application Ser. No. 
09/502,877 (February 10, 2000). 

Other scattering and absorbing mediums are known to those skilled in the art. 
25 Concentrations of these tissue simulants may be adjusted to match the 
scattering and absorption coefficients of body tissues such as skin or internal 
organs. 

It will also be apparent that a set of exemplar measurements is required for 
30 cluster definition and development of calibration models, in addition to the actual 
spectral measurements utilized for classification. 

CLASSIFICATION 

New spectral measurements are passed to a pattern classification system that 
35 classifies the measurements into previously defined clusters having a high 
degree of internal constistency through spectral features related to instrumental 
variation. 

The classification itself includes the following steps: 
1 . Feature extraction; and 
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2. Classification of features according to a classification model. 
FEATURE EXTRACTION 

The process of feature extraction is more fully described in the parent application 
5 to the current application, U.S. Patent Application Ser. No. 09/359,191, supra. 
Feature extraction is any mathematical transformation that enhances a particular 
aspect or quality of the data that is useful for interpretation. Features may be of 
two categories: 

0 1 . Abstract, and 

2. Simple. 

Instruments may be classified by abstract features, meaning that they may be 
classified using computational methods. These methods may be supervised or 
5 unsupervised. Examples include plotting primary principal components versus 
one another and identifying clusters of results; discriminant analysis, such as 
measurement of the Mahalanobis distance, and k-means clustering. Additional 
methods will be readily apparent to those skilled in the art. 

It is important to note that the clustering techniques listed above are not mutually 
exclusive. Clustering of raw spectra into one or more groups may be achieved 
through one or more of these methods and with combinations of these 
approaches. 

Simple features are derived from an a priori understanding of the system, and 
can be related directly to an instrument parameter or component (or parameters 
or components). For example, the measured bandwith, noise characteristics, or 
linearity and detector cutoffs. 

CLUSTER DEFINITION 

As indicated above, clusters must have been previously defined, using a data 
set of exemplar spectral measurements. Cluster definition is the assignment of 
the measurements in the exploratory data set to clusters. After cluster definition, 
the measurements and class assignments are used to determine the mapping 
from the features to cluster assignment. 

Cluster definition is performed using either supervised or unsupervised 
methods. In the supervised case, clusters may be defined using specific 
knowledge of the system. For example, source intensity and detector 
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temperatures have specific effects on the spectra, as demonstrated further 
below. The use of a priori information in this manner is the first step in supervised 
pattern recognition, which develops classification models when the class 
assignment is known. 

5 

Alternatively, clusters may be defined in an unsupervised manner using abstract 
features such as clustering within plots of principal component scores V versus 
spectral loading y. The result is that within a given cluster, all of the spectra have 
the same characteristics (interferences, instrument variations or sample issues). 
10 Clusters formed from features with physical meaning can be interpreted based 
on the known underlying phenomenon causing variation in the feature space. 

CLASSIFICATION 

Subsequent to class definition, a classifier is designed through supervised 
15 pattern recognition. A model is created, based on cluster definitions, that 
transforms a measured set of features to an estimated classification. The 
classification model is a method for determining a set of similarity measures with 
the predefined clusters. A decision rule assigns membership on the basis of a 
set of measures calculated by a decision engine. 

20. 

W CALIBRATION 

Once the spectra have been classified into clusters, calibration models 14 are 
required for each cluster. Provision of the clusters is by one of two methods. In - 
the preferred embodiment, a master calibration is developed for a first cluster, 
25 subsequently the master calibration is transferred to slave calibrations, one for 
each remaining cluster. An alternate embodiment, described further below, 
individually calculates calibrations for each cluster. 

In the case of spectra collected utilizing a master and slave instrument, the term 
30 "calibration transfer," as commonly used in the art, may have different meanings. 
Calibration transfer may refer to transforming the slave spectral to look like the 
master spectra or vice versa. Additionally, spectra from both the master and the 
slave can be transferred to a common standard spectrum not present in the 
master or slave data sets. Furthermore, calibration transfer can refer to 
35 preprocessing steps, multivariate adjustments of the sample spectra, or 
adjustments to the predicted analyte concentrations based upon standards. 

Referring now to Figure 2, the process of calibration transfer is shown. 
Calibrations 14 are required for each of n clusters. A master calibration 21 is 

-8- 
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calculated for a first cluster; then, to provide calibrations for each of the remaining n 
clusters, the master calibration is transferred to slave calibrations. In this case, 
Calibration transfer refers to the process of transforming spectra on a slave 
instrument to match the characteristics of the master such that the master calibration 
can be applied to the slave spectra. For clusters, calibration transfer refers to the 
process of transforming spectra on a slave instrument to match the characteristics 
of the master such that a new calibration is generated that satisfies the 
specification of a cluster other than the one for which the master cluster was 
developed. The clusters may also be organized into groups of clusters, so that 
the master calibration is transferred to slave calibrations 23, which in turn are 
transferred to the slave calibration for the various clusters within each group. 

Prior art methods of calibration transfer have been unsuccessful at modeling the 
complexity involved in providing calibration models for large numbers of 
instruments. Classifying the spectral measurements in to clusters having a high 
degree of internal consistency reduces the complexity of the problem to a level 
that makes it possible to apply calibration transfer to large numbers of 
instruments. Clustering the acquired spectra into a limited number of subgroups 
allows the calibration transfer issues to be broken up into subsets so that only a 
subset of the overall issues needs to be addressed between any two clusters. 
The result is that within a given cluster, all of the spectra have the same 
characteristics. A calibration is generated for a given cluster using spectra of 
samples containing the limited number of characteristics of that cluster. This 
eliminates having to deal with all of the variations possible in the raw 
measurements, significantly reducing the complexity that needs to be modeled 
by the calibration. Thus, the calibration utilized for a given cluster will have 
instrument variations included in the model that are similar to variations resulting 
from the analyzer employed. As each cluster has its own specification, the 
calibration transfer technique need only deal with the differences between those 
two clusters. For example, if the only difference between the clusters is a linear 
x-axis shift, then the calibration transfer technique need only deal with that 
parameter. This allows a more specific calibration transfer technique to be 
employed that is more robust, resulting in fewer factors, since fewer instrument 
variations need to be modeled. This will result in analysis of lower concentration 
analytes due to retention of signal to noise in the calibration transfer step. 

Typically, calibration transfer techniques have to address instrumenMo-instrument 
variations, such as wavelength shifts, nonlinear wavelength shifts, wavelength 
expansions (contraction), nonlinear wavelength expansions, source intensity 
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drifts, blackbody profile changes, bandwidth (resolution) changes, baseline 
deviations, change with time, temperature effects and others known to those 
skilled in the art. 

5 However, in addition to instrumentation issues in calibration transfer, sample 

composition and sample handling considerations are very important in calibration 
• > transfer. Therefore, every sample analyzed for prediction of results should be an 

interpolation within the matrix space of the calibration data set; otherwise the 
i calibration may not calculate an accurate prediction of analyte concentration. For 
] 0 example, if the calibration data set contains glucose samples ranging from 40 to 

400 mg/dL, then predictions on samples with glucose outside of this range are 

suspect. 

Thus, an important additional benefit of the invented classification algorithm is that 
15 outlier spectra may be identified, as shown in Figure 1. Each cluster has its own 
set of classification requirements. If a spectrum does not fall within the parameters 
of a given cluster, another cluster must be found which has parameters allowing 
analysis of that spectrum. If no cluster is found, then the sample is reported as an 
outlier rather than having a value reported for the analyte that may not be correct. 
20 In this manner, analysis on samples or instruments for which no satisfactory 
r< calibration model has been developed is prevented. 

Spectra classified as outliers may assume two forms. In some instances, the 
outlier spectrum is close to a given cluster. In such a case, conventional calibration 
25 transfer techniques may be applied to this spectrum to transform the spectrum 
such that it falls within one of the clusters for which calibrations exist. In case of 
failure, the spectrum remains classified as an outlier, as will spectra that are 
characterized as gross outliers. 

30 Classification and calibration transfer need not be limited to differences h 
instrumentation; it may also be applied to sample spectra for analyte prediction. 
For example, a cluster may be defined for healthy 18 to 30 year old men. A 
noninvasive glucose model may be built for this narrow demographic. A 
separate cluster may be for 1 8 to 30 year old non-pregnant women. Differences 

35 between the two clusters such as body fat and sampled volume may be 
addressed and a calibration transfer technique may be optimized for those 
differences based on relatively few spectra, thus allowing a glucose model initially 
generated for a narrow demographic to be gradually expanded. 
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10 



The foregoing embodiment does not require calibrations to be built for every 
cluster, and thereby provides the important advantage of sparing the 
considerable time, money and effort required for building a separate calibration 
for every cluster. The actual calibration transfer technique employed may be 
selected from the many algorithms known to those skilled in the art. 

An alternative embodiment classifies spectra into clusters, as in the preferred 
embodiment, and builds individual calibrations for each cluster. The requirement 
for calibration transfer is eliminated at the expense of the requirement for more 
spectra. However, an additional measure of reliability is gained by having 
clusters with well-defined and narrow parameters. As in the previous 
embodiment, fewer instrument variations need to be modeled so that early 
factors can focus on analyte information, resulting in a more robust model. 

15 While the foregoing description of the invention has been directed to 
characterizing different instruments, the invented method also finds application h 
classifying spectra from a single instrument according to known variations h 
instrument components or known environmental variations. Two examples are 
described. 

20 

1. The cutoff of the 2.6 I m InGaAs detector employed in many near-IR 
' analyzers blue shifts as the temperature of the detector decreases. Air 

spectra collected on a NICOLET 860, supplied by the Nicolet Instrument 
Corporation (Madison Wl) with the InGaAs detector ranging from 0 to 

25 -20°C are presented in Figure 3A. The spectral cutoff region from 2400 

to 2600 nm is expanded in Figure 3B. The cut-off decreases as the 
detector temperature decreases. Using reference or air spectra, a simple 
analysis, such as determining the wavelength at which 10% of the peak 
intensity is observed, allows the instrument to be classified as to whether 

30 the detector is properly cooled and to the extent that it was cooled. 

2. As a tungsten halogen source increases in temperature, the blackbody 
emission increases in magnitude and additional light throughput is 
observed. For example, in diffuse reflectance spectra of a 5% 

35 Labsphere diffuse reflectance standard, the overall intensity is observed 

to increase as the source temperature increases, as shown in Figure 4. 
This effect can be used to classify the instrument in terms of the source 
temperature and total light throughput. 
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It is a relatively simple task to characterize the instrument in terms of additional 
components. Examples are slit width, affecting bandwidth or room temperature, 
affecting total light throughput. There are, however, a limited' number of 
5 components to a spectrometer; it has been observed that instruments group 
into a relatively small number of clusters. 

r. Once an instrument, or a given spectrum, is classified into a cluster, the 
appropriate calibration routine can be employed. A single analyzer may be 
10 loaded with multiple calibration routines, as the instrument may vary during its 
lifetime. In fact, environmental factors may cause the appropriate calibration to 
change with every spectrum collected. 

This instrument classification approach was designed for use on noninvasive 
15 glucose analyzers where glucose is measured using diffuse reflectance spectra of 
skin on the human body. However, the same technology can be applied to any 
form of noninvasive analysis including but not limited to: noninvasive analysis of 
albumin, globulin, urea, creatinine, oxygen, hemoglobin and electrolytes 
such as Na + , K + , CI*. The technology may also be utilized in biomedical 
20 applications. 

The classification approach disclosed here finds application in various fields of 
endeavor, the agricultural field, for example. Exemplary agricultural applications 
are: analysis of fat in. milk, protein or moisture analysis in wheat, or analysis of 

25 sugars in fruits. The classification approach may also benefit pharmaceutical 
companies in the analysis of intact tablets or in the characterization of raw 
materials. Finally, petrochemical companies may utilize the method in classification 
of fuels, fuel by-products and in raw fuel materials. In general, this technique is 
useful wherever a large number of analyzers are utilized to quantify analytes h 

30 samples. 

Although the invention is described herein with reference to the preferred 
embodiment, one skilled in the art will readily appreciate that other applications 
may be substituted for those set forth herein without departing from the spirit and 
35 scope of the present invention. Accordingly, the invention should only be limited 
by the claims included below. 
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CLAIMS 



What is claimed is: 

5 

1 . A method of characterizing spectrometer instruments according to instrumental 
variation, comprising the steps of: 

providing standard spectral measurements from at least one spectrometer 
instrument; and 

10 classifying said spectral measurements into predefined clusters on the 

basis of extracted spectral features; and 

providing calibration models for each of said predefined clusters, wherein 
said calibration model compensates for said instrumental variation. 

1 5 2. The method of Claim 1 wherein said instrumental variation comprises any 
of: 

wavelength shifts; 

nonlinear wavelength shifts; 

wavelength expansions; 
20 wavelength contractions; 

nonlinear wavelength expansions; 

source intensity drifts; 

blackbody profile changes; 

bandwidth changes; 
25 resolution changes; 

baseline deviations; 

changes over time; 

temperature effects; 

detector response; 

30 differences in optical components (e.g. long-pass filters or fiber optics); 

variation related to mounting of references; 
differences in the optical interface to the sample (fiber spacing); 
linearity; and 
detector cut-off. 



35 



3. The method of Claim 1, wherein said standard spectra are measured on a 
plurality of spectrometer instruments. 
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4. The method of Claim 1 , wherein said standard spectral are measured on a 
single spectrometer instruments at successive time intervals. 

5. The method of Claim 1 } wherein said classifying step comprises the 
5 steps of: 

extracting features; and 

classifying said features according to a classification model and decision 

rule. 

1 0 6. The method of Claim 5, wherein said feature extraction step comprises 
any mathematical transformation that enhances, a particular aspect or quality of 
data that is useful for interpretation. 

7. The method of Claim 3, wherein said classification model comprises 
1 5 means for determining a set of similarity measures with predefined classes. 

8. The method of Claim 5, wherein said decision rule comprises means for 
assigning class membership on the basis of a set of measures calculated by a 
decision engine. 

20 

:y 9. The method of Claim 4, wherein individual features are divided into two 
categories, said categories comprising: 

abstract wherein said features are extracted using various computational 
methods ; and 

25 simple features that are derived from an a priori understanding of a 

system, wherein said feature is directly related to an instrument parameter or 
component. 

1 0. The method of Claim 7, wherein said abstract features are calculated using 
30 any of: 

plotting primary principal components versus one another and identifying 
resulting clusters; 

discrminant anlysis; and 
k-means clustering. 

35 

1 1 . The method of Claim 5, wherein said classification step further comprises 
the step of employing factor-based methods to build a model capable of 
representing variation in a measured spectrum related to variations in spectral 
response; 



-14- 



WO 02/25233 



PCT7US0J/2570f> 



wherein projection of a measured absorbance spectrum onto said model 
constitutes a feature that represents spectral variation related to instrument 
variation. 

5 12. The method of Claim 5, wherein said classifying step further comprises 
the steps of: 

measuring the similarity of a feature to predefined clusters; and 
assigning membership in a cluster. 

10 13. The method of Claim 5, further comprising the step of: 

assigning measurements in an exploratory data set to clusters. 

14. The method of Claim 13, further comprising the step of: 

using measurements and class assignments to determine a mapping from 
1 5 features to cluster assignments. 

15. The method of Claim 13, further comprising the steps of: 

defining clusters from said features in a supervised manner, wherein each 
set of teatures is divided Into two or more regions, and wherein classes are . 
20 defined by combinations of feature divisions; 

designing a classifier subsequent to class definition through supervised Q 
pattern recognition by determining an optimal mapping or transformation from the 
feature space to a class estimate which minimizes the number of !fj 
misclassifications; and ^9 
25 creating a model based on class definitions which transforms a measured =d 

set of features to an estimated classification. ^ 

< 

1 6. The method of Claim 1 , further comprising the step of providing calibration y$ 
models for analysis of new sample measurements. (UJ 

30 CO 

17. The method of Claim 16, wherein said calibration models model 
differences between said predefined clusters. 

18. The method of Claim 16, wherein a master calibration model is 
35 developed for a first of said clusters from a set of exemplar spectra with 

reference values and pre-assigned classification definitions. 



19. The method of Claim 18, further comprising the step of transferring said 
master calibration model to a plurality of slave calibration models, wherein a slave 
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calibration model is calculated for each remaining cluster, and wherein a transform 
modifies said master calibration model to a slave calibration model in accordance 
with principal features defining each of said classes. 

5 

20. The method of Claim 19, wherein said transferring step comprises the 
: steps of: 

transferring said master calibration model to a first slave calibration model; 
transferring said first slave calibration model to a second slave calibration 

10 model; 

and repeating said transfer from one slave calibration model to another 
slave calibration model, until a calibration . has been provided for each of said 
predefined clusters; 

wherein a transform modifies said transfened calibration models h 
1 5 accordance with principal features defining each of said clusters. 

21 . The method of Claim 1 8, further comprising the step of transferring said 
master calibration model to a plurality of slave calibration models, wherein a slave 
calibration model is calculated for each remaining cluster, and wherein a transform 

20 modifies said slave calibration model to said master calibration model h 
flar accordance with principal features defining each of said classes. 

22. The method of Claim 21, wherein said transferring step comprises the 
steps of: 

25 transferring said master calibration model to a first slave calibration model; 

transferring said first slave calibration model to a second slave calibration 

model; 

and repeating said transfer from one slave calibration model to another 
slave calibration model, until a calibration has been provided for each of said 
30 predefined clusters; 

wherein a transform modifies said transferred calibration models h 
accordance with principal features defining each of said clusters. 

23. The method of Claim 16, wherein a different calibration model is 
35 developed for each class, and wherein said calibration models are developed 

from a set of exemplar spectra, with reference values and pre-assigned cluster 
definitions. 
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24. The method of Claim 23, wherein a spectrum is assigned to one of many 
predefined .clusters for which a calibration model has been developed. 

25. The method of Claim 1 , further comprising the steps of: 
5 providing new spectral measurements; 

comparing said new spectral measurements to each of said pre-defined 
clusters according to extracted spectral features; 

reporting those measurements as outliers for which a matching cluster is 
not found. 

10 

26. A method of developing calibration models for spectral analysis 
comprising the steps of: 

defining clusters from an exemplar data set of spectral measurements, 
wherein said clusters exhibit a high degree of internal similarity; 
15 mapping said clusters to one another, wherein principal features 

distinguishing clusters from one another are determined; 

calculating a calibration model for a first of said clusters, said calibration 
model comprising a master calibration; 

transferring said master calibration to at least one slave calibration, wherein 
20 a slave calibration comprises a calibration derived by applying a transform to 
slave spectra such that the master calibration now models the difference between 
the master cluster and another cluster corresponding to said slave spectra. 
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